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Abstract 

Background:  Like  human  immunodeficiency  virus  (HIV),  ovine  lentivirus  (OvLV)  is  macrophage-tropic  and  causes  lifelong 
infection.  OvLV  infects  one  quarter  of  U.S.  sheep  and  induces  pneumonia  and  body  condition  wasting.  There  is  no  vaccine 
to  prevent  OvLV  infection  and  no  cost-effective  treatment  for  infected  animals.  However,  breed  differences  in  prevalence 
and  proviral  concentration  have  indicated  a  genetic  basis  for  susceptibility  to  OvLV.  A  recent  study  identified  TMEM154 
variants  in  OvLV  susceptibility.  The  objective  here  was  to  identify  additional  loci  associated  with  odds  and/or  control  of 
OvLV  infection. 

Methodology/Principal  Findings:  This  genome-wide  association  study  (GWAS)  included  964  sheep  from  Rambouillet, 
Polypay,  and  Columbia  breeds  with  serological  status  and  proviral  concentration  phenotypes.  Analytic  models  accounted 
for  breed  and  age,  as  well  as  genotype.  This  approach  identified  TMEM154  (nominal  P  =  9.2x10-7;  empirical  P  =  0.1 3), 
provided  12  additional  genomic  regions  associated  with  odds  of  infection,  and  provided  13  regions  associated  with  control 
of  infection  (all  nominal  P<1x1CT5).  Rapid  decline  of  linkage  disequilibrium  with  distance  suggested  many  regions 
included  few  genes  each.  Genes  in  regions  associated  with  odds  of  infection  included  DPPA2/DPPA4  (empirical  P  =  0.006), 
and  SYTL3  (P  =  0.051).  Genes  in  regions  associated  with  control  of  infection  included  a  zinc  finger  cluster  (ZNF192,  ZSCAN16, 
ZNF389,  and  ZNF165 ;  P  =  0.001),  C19orf42/TMEM38A  (P  =  0.047),  and  DEC  API  (P  =  0.092). 

Conclusions/Significance:  These  associations  provide  targets  for  mutation  discovery  in  sheep  susceptibility  to  OvLV.  Aside 
from  TMEM154,  these  genes  have  not  been  associated  previously  with  lentiviral  infection  in  any  species,  to  our  knowledge. 
Further,  data  from  other  species  suggest  functional  hypotheses  for  future  testing  of  these  genes  in  OvLV  and  other  lentiviral 
infections.  Specifically,  SYTL3  binds  and  may  regulate  RAB27A,  which  is  required  for  enveloped  virus  assembly  of  human 
cytomegalovirus.  Zinc  finger  transcription  factors  have  been  associated  with  positive  selection  for  repression  of  retroviral 
replication.  DLGAP1  binds  and  may  regulate  DLG1,  a  known  regulator  of  HIV  infectivity. 
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Introduction 

Ovine  lentivirus  (OvLV),  like  the  human  immunodeficiency 
virus  (HIV),  is  a  macrophage-tropic  lentivirus  that  leads  to 
persistent,  lifelong  infection  of  the  host  [1,2].  Seroprevalence  of 
OvLV  is  24—26%  in  U.S.  domestic  sheep,  with  positive  animals  in 
approximately  half  of  all  U.S.  flocks  [3,4],  Ovine  lentivirus 
infection  leads  to  varying  degrees  of  dyspnea  (respiratory  distress), 
cachexia  (body  condition  wasting),  mastitis,  arthritis,  and/ or 
encephalitis  [5,6].  One  of  the  most  commonly  observed  symptoms 
is  an  interstitial  pneumonia  [5, 6, 7, 8]  that  has  led  to  additional 
names  for  OvLV  including  ovine  progressive  pneumonia  virus  and 


maedi-visna  virus,  front  the  Icelandic  “maedi”  for  dyspnea.  As 
with  all  known  lentiviruses,  there  is  currently  no  known  way  to 
eliminate  OvLV  infection,  and  no  vaccine  to  completely  prevent 
infection  [9], 

However,  breed  differences  in  susceptibility  to  OvLV  suggest 
that  it  may  be  possible  to  breed  animals  with  reduced  susceptibility 
to  the  virus.  It  has  been  shown  that  reproducible  breed  differences 
exist  in  seroprevalence  of  OvLV  across  several  studies  done  in 
different  locations  over  many  decades  [10,11,12,13,14],  For 
example,  Rambouillet  sheep  had  lower  odds  of  infection, 
Columbia  sheep  had  higher  odds  of  infection,  and  the  recent 


PLOS  ONE  |  www.plosone.org 


1 


October  2012  |  Volume  7  |  Issue  10  |  e47829 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

17  OCT  2012  2' REPORT  TYPE 

3.  DATES  COVERED 

00-00-2012  to  00-00-2012 

4.  TITLE  AND  SUBTITLE 

Genome-Wide  Association  Identifies  Multiple  Genomic  Regions 

Associated  with  Susceptibility  to  and  Control  of  Ovine  Lentivirus 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

U.  S.  Department  of  Agriculture, Animal  Disease  Research  Unit, 
Agricultural  Research  Service, Pullman, WA, 99163 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

PLoS  ONE,  vol  7,  no.  10,  Published:  October  17,  2012,  Government  or  Federal  Purpose  Rights  License 

14.  ABSTRACT 

Background:  Like  human  immunodeficiency  virus  (HIV),  ovine  lentivirus  (OvLV)  is  macrophage-tropic 
and  causes  lifelong  infection.  OvLV  infects  one  quarter  of  U.S.  sheep  and  induces  pneumonia  and  body 
condition  wasting.  There  is  no  vaccine  to  prevent  OvLV  infection  and  no  cost-effective  treatment  for 
infected  animals.  However,  breed  differences  in  prevalence  and  proviral  concentration  have  indicated  a 
genetic  basis  for  susceptibility  to  OvLV.  A  recent  study  identified  TMEM154  variants  in  OvLV 
susceptibility.  The  objective  here  was  to  identify  additional  loci  associated  with  odds  and/or  control  of 
OvLV  infection.  Methodology/Principal  Findings:  This  genome-wide  association  study  (GWAS)  included 
964  sheep  from  Rambouillet  Polypay,  and  Columbia  breeds  with  serological  status  and  proviral 
concentration  phenotypes.  Analytic  models  accounted  for  breed  and  age,  as  well  as  genotype.  This 
approach  identified  TMEM154  (nominal  P  =  9.261027;  empirical  P  =  0.13)  provided  12  additional  genomic 
regions  associated  with  odds  of  infection,  and  provided  13  regions  associated  with  control  of  infection  (all 
nominal  P, 161025).  Rapid  decline  of  linkage  disequilibrium  with  distance  suggested  many  regions  included 
few  genes  each.  Genes  in  regions  associated  with  odds  of  infection  included  DPPA2/DPPA4  (empirical  P  = 
0.006)  and  SYTL3  (P  =  0.051).  Genes  in  regions  associated  with  control  of  infection  included  a  zinc  finger 
cluster  (ZNF192,  ZSCAN16  ZNF389,  and  ZNF165;  P  =  0.001),  C19orf42/TMEM38A  (P  =  0.047),  and 
DLGAP1  (P  =  0.092).  Conclusions/Significance:  These  associations  provide  targets  for  mutation  discovery 
in  sheep  susceptibility  to  OvLV.  Aside  from  TMEM154,  these  genes  have  not  been  associated  previously 
with  lentiviral  infection  in  any  species,  to  our  knowledge.  Further,  data  from  other  species  suggest 
functional  hypotheses  for  future  testing  of  these  genes  in  OvLV  and  other  lentiviral  infections.  Specifically, 
SYTL3  binds  and  may  regulate  RAB27A,  which  is  required  for  enveloped  virus  assembly  of  human 
cytomegalovirus.  Zinc  finger  transcription  factors  have  been  associated  with  positive  selection  for 
repression  of  retroviral  replication.  DLGAP1  binds  and  may  regulate  DLG1,  a  known  regulator  of  HIV 
infectivity. 


15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

Same  as 
Report  (SAR) 

18.  NUMBER 
OF  PAGES 

11 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


GWAS  Susceptibility/Control  of  Ovine  Lentivirus 


composite  breed  Polypay  were  intermediate  but  with  greater 
within-breed  variation  [10,13,15].  More  recently,  direct  measure¬ 
ments  of  integrated  provirus  have  confirmed  breed  differences  not 
only  in  odds  of  infection  but  also  relative  immune  control  of  viral 
levels  once  infected  [15].  Specifically,  Rambouillet  sheep  had 
lower  proviral  concentrations  than  Columbia  sheep,  and  Polypay 
were  again  intermediate  with  greater  within-breed  variation  [15]. 
Recently,  mutations  in  one  gene  (' TMEM154 :  Transmembrane  Protein 
154)  were  identified  that  contributed  to  relative  susceptibility  to 
OvLV  in  multiple  breed  backgrounds  and  widely  differing 
environmental  conditions  [16].  However,  the  known  mutations 
have  not  been  shown  to  confer  complete  resistance  to  OvLV,  and 
high  prevalence  has  been  observed  in  at  least  some  flocks 
containing  predominantly  the  less  susceptible  sheep  TMEM154 
genotype  [16].  Importantly,  these  and  other  genetic  variants,  once 
identified,  can  be  used  for  effective  marker-assisted  selection  even 
in  the  absence  of  pathogen  exposure,  which  could  greatly  improve 
animal  welfare  and  economic  concerns  when  breeding  for  reduced 
susceptibility  to  OvLV. 

Recent  developments  in  genotyping  technology  should  hasten 
identification  of  additional  genes  that  could  expand  marker- 
assisted  selection  in  sheep.  Sufficiently  dense  marker  systems 
(>50,000  single  nucleotide  polymorphisms  (SNP))  have  recently 
been  released  to  allow  examination  of  genome-wide  variants 
related  to  phenotypes  of  interest  [17],  These  genome-wide 
association  studies  (GWAS)  take  advantage  of  the  tendency  of 
chromosome  segments  to  be  inherited  together,  which  is  referred 
to  as  linkage  disequilibrium  (LD).  Specifically,  when  unobserved 
functional  variants  are  inherited  together  with  observed  SNP,  the 
observed  SNP  demonstrate  association  with  the  phenotype  of 
interest.  Further,  it  is  widely  acknowledged  that  association  in 
multiple  populations  greatly  enhances  the  value  of  genetic  markers 
because  it  suggests  that  the  markers  are  close  enough  to  causal 
mutations  to  be  in  population-wide  LD,  a  much  higher  standard 
than  association  in  only  one  population  [18].  Therefore,  this  study 
examined  genome-wide  association  with  both  susceptibility  to 
OvLV  and  control  of  OvLV  replication  in  multiple  breeds. 

Results 

Genotypes  and  Phenotypes 

From  the  original  set  of  997  animals,  sample  quality  control 
criteria  eliminated  18  samples  from  additional  analysis.  Multi¬ 
dimensional  scaling  (MDS)  identified  3  clusters  corresponding  to 
breed,  and  15  animals  were  eliminated  for  outlier  status  (Figure 
SI).  The  remaining  animals  included  Rambouillet  (N  =  399), 
Polypay  (N  =  423),  and  Columbia  (N  =  142).  Pairwise  population 
concordance  (PPC)  clusters  matched  breed  for  Columbia  and 
Polypay  but  resulted  in  subdividing  the  Rambouillet  breed  into  2 
clusters  (Rambouillet_l  and  Rambouillet_2).  These  clusters  were 
used  as  stratification  criteria  for  subsequent  association  analysis. 
Average  genotyping  call  rate  in  the  remaining  animals  was 
98.06%.  After  initial  screening  for  genotyping  call  rate  by 
individual  and  by  SNP,  minor  allele  frequency,  and  Hardy- 
Weinberg  equilibrium,  there  were  slightly  varying  numbers  of 
SNP  for  subsequent  analysis  depending  on  the  animal  set:  49,233 
(all  breeds),  50,275  (Rambouillet),  50,264  (Polypay),  and  44,258 
(Columbia). 

Of  the  964  animals  that  passed  initial  tests  for  genotype  call 
quality,  95%  had  serological  phenotypes  and  96%  had  proviral 
concentration  phenotypic  data.  The  remaining  blood  samples 
were  not  of  sufficient  quality  to  report  results.  Among  diose 
individuals  with  serological  results,  35%  were  positive  for  OvLV. 
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Decline  of  Linkage  Disequilibrium  with  Distance 

To  estimate  the  length  of  useful  LD  and  thus  provide  genome¬ 
wide  average  expectation  for  the  approximate  distance  from 
identified  SNP  within  which  underlying  mutations  might  be  found, 
LD  was  calculated  for  pairs  of  observed  SNP.  Decline  of  LD  with 
physical  distance  in  each  breed  is  shown  in  Figure  S2.  The  first 
distance  bin  exceeding  the  threshold  of  1000  observations  was  5— 
10  kilobase  pairs  (Kb),  and  this  bin  is  reported  in  Figure  S2  as 
10  Kb.  All  individual  breeds  had  r2<0.40  at  10  Kb  and  r2<0.25 
at  35  Kb  (Figure  S2).  The  combined  all-breeds  set  had  lower  LD 
of  r2<0.35  at  10  Kb,  r2<0.25  at  20  Kb,  and  r2<0.20  at  35  Kb 
(Figure  S2). 

Genome-wide  Association  with  Serological  Status 

Because  OvLV  infection  is  lifelong,  concordance  between 
serological  status  and  direct  viral  measures  of  infection  is  high 
and  serological  status  is  a  proxy  for  odds  of  infection  in  exposed 
populations  [19].  Genome-wide  association  with  serological  status 
yielded  13  unique  SNP,  including  3  genome-wide  significant  and 
10  genome-wide  suggestive  SNP  (Table  1).  For  each  SNP,  results 
from  the  best-fitting  mode-of-inheritance  association  model  were 
reported  in  Table  1.  Only  one  SNP  (OAR1_185953850)  was 
identified  in  separate  analyses  of  both  the  all-breeds  animal  set  and 
in  an  individual  breed  (Polypay).  After  accounting  for  TMEM154 
risk  status  as  previously  reported  [16],  the  Table  1  SNP  were 
associated  with  serologic  status  to  varying  degrees  except 
OAR17_5388531,  the  SNP  located  within  TMEM154  (Table  2). 
Further,  no  additional  SNP  became  genome-wide  suggestive  after 
accounting  for  TMEM154  risk  status. 

For  each  analysis,  a  second  run  was  performed  that  dropped 
screening  criteria  including  genotype  call  rate  by  individual  and  by 
SNP,  minor  allele  frequency,  and  Hardy-Weinberg  equilibrium. 
This  technique  had  the  potential  to  identify  associated  SNP  that 
failed  one  or  more  of  the  screening  criteria,  for  example  common 
homozygous  lethal  genomic  regions,  but  no  additional  genome¬ 
wide  suggestive  SNP  were  identified.  No  SNP  reached  genome¬ 
wide  suggestive  association  in  the  Columbia  analysis. 

A  Manhattan  plot  showing  P-values  arranged  by  chromosome 
position  is  shown  in  Figure  1 .  A  series  of  Quantile-Quantile  (Q-Q 
plots  showing  observed  versus  expected  P-value  distributions  are 
shown  in  Figures  S3,S4,S5.  Initial  analysis  showed  population 
stratification  (Figure  S3),  but  most  of  diis  apparent  population 
stratification  was  eliminated  by  accounting  for  SNP  or  genes 
identified  in  Table  1  (Figures  S4,S5). 

Genome-wide  Association  with  Proviral  Concentration 

The  concentration  of  provirus  in  peripheral  blood  was  used  as  a 
measure  of  control  of  infection,  and  it  has  been  shown  to  be 
correlated  with  OvLV  pathology  [20].  Genome-wide  association 
with  proviral  concentration  identified  13  SNP,  including  2 
genome-wide  significant  and  1 1  genome-wide  suggestive  SNP 
(Table  3).  For  each  SNP,  results  from  the  best-fitting  mode-of- 
inheritance  association  model  were  reported  in  Table  3.  The 
genotypic  mean  proviral  concentrations,  adjusted  for  age  and 
breed,  were  reported  in  Table  SI. 

For  each  analysis,  a  second  run  was  performed  that  dropped 
screening  criteria  including  genotype  call  rate  by  individual  and  by 
SNP,  minor  allele  frequency,  and  Hardy-Weinberg  equilibrium. 
This  technique  had  the  potential  to  identify  associated  SNP  that 
failed  one  or  more  of  the  screening  criteria,  but  no  additional 
genome-wide  suggestive  SNP  were  identified.  The  Columbia 
animal  set  included  fewer  animals  with  many  fewer  polymorphic 
SNP,  and  results  from  association  with  proviral  concentration  in 
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Table  1.  Genomic  regions  associated  with  susceptibility  to  OvLV. 


SNP 

Chr 

Position  (bp) 

Animal  Set 

Best  fitting 
model 

Nominal 

P-value 

EmpiricalP- 

value 

Odds 

Ratio 

Genes  within  100  Kb  on  either 
side 

OAR1_1 85953850 

1 

172,600,491 

All 

additive 

3.3x10~8 

0.006 

1.98 

DPPA2**,  DPPA4* 

OAR1_1 85953850 

1 

172,600,491 

Polypay 

additive 

6.5x10  7 

0.048 

2.60 

DPPA2**,  DPPA4* 

OAR1_1 86779231 

1 

173,437,685 

Polypay 

dominant 

1 .7 xIO7 

0.012 

4.45 

- 

OAR4_38205790 

4 

35,398,410 

All 

dominant 

3.1  x10~6 

§ 

2.84 

- 

S5451 1 

6 

12,045,872 

Polypay 

additive 

3.8x10  6 

§ 

2.36 

CAMK2D** 

OAR7_82644472 

7 

75,607,567 

Polypay 

additive 

5.7x10  6 

§ 

2.53 

GPHN** 

OAR8_73555614 

8 

68,927,958 

All 

recessive 

8.7x10  6 

§ 

2.61 

UTRN*,  STX1 1 

OAR8_88021 348 

8 

82,158,519 

Rambouillet 

recessive 

4.2x10  6 

0.051 

4.46 

SYTL3**,  GTF2H5**,  DYNLT1*, 
TMEM181,  EZR 

OAR9_35880400 

9 

34,065,208 

Polypay 

additive 

2.4x10  6 

§ 

2.46 

ST18 

OAR1 7_5388531 

17 

4,862,358 

All 

additive 

9.2x10  7 

0.13 

7.57 

TMEM154** 

si  9031 

17 

12,016,817 

All 

additive 

6.4x10~6 

§ 

1.73 

- 

OAR18_9395406 

18 

9,449,325 

All 

genotypic 

2.1  x10~6 

§ 

2.74 

FUS* 

OAR20_1 9572554 

20 

18,822,181 

All 

additive 

2.0x10  6 

§ 

1.80 

SUPT3H**,  RUNX2** 

S56930 

X 

22,443,790 

All 

genotypic 

7.2x10  6 

§ 

2.43 

- 

5:  P>0.1 5 

**:  SNP  located  within  gene 

*:  SNP  located  within  35  Kb  of  gene 

doi:10.1371/journal.pone.0047829.t001 


Columbia  sheep  were  reported  in  Table  S2.  Genotype  frequencies 
of  reported  SNP  were  included  in  Table  S3. 

A  Manhattan  plot  showing  P-values  arranged  by  chromosome 
position  is  shown  in  Figure  2.  A  series  of  Quantile-Quantile  (Q;Q) 
plots  showing  observed  versus  expected  P-value  distributions  are 


shown  in  Figures  S6,S7.  The  initial  analysis  showed  apparent 
population  stratification  (Figure  S6),  but  most  of  this  apparent 
population  stratification  was  eliminated  by  accounting  for  SNP  in 
Table  3  (Figure  S7). 


Table  2.  Genomic  regions  associated  with  OvLV  serologic  status  comparing  association  with  and  without  accounting  for 
TMEM154  mutations. 


SNP 

Animal  Set 

Nominal 

P-value  (Without 
TMEM154) 

Nominal 

P-value 
(Accounting 
TMEM1 54) 

Empirical 

P-value 

(Without 

TMEM154) 

Empirical 

P-value 

(Accounting 

TMEM154) 

Genes  within  100  Kb  on 
either  side 

OAR1_1 85953850 

All 

3.3x1(T8 

5.2x10~6 

0.006 

§ 

DPPA2**,  DPPA4* 

OAR1_1 85953850 

Polypay 

6.5x1(T7 

3.8x10~5 

0.048 

§ 

DPPA2**,  DPPA4* 

OAR1_1 86779231 

Polypay 

1.7  x10~7 

4.5x10~6 

0.012 

§ 

- 

OAR4_38205790 

All 

3.1  x1(T6 

I.OxlO-4 

§ 

§ 

- 

S54511 

Polypay 

3.8x10~6 

6.7  x10~4 

§ 

§ 

CAMK2D** 

OAR7_82644472 

Polypay 

5.7  x10~6 

3.0x10~5 

§ 

§ 

GPHN** 

OAR8_73555614 

All 

8.7  x10~6 

5.8x10~5 

§ 

§ 

UTRN*,  STX1 1 

OAR8_88021 348 

Rambouillet 

4.2x1(T6 

3.2  xIO-6 

0.051 

0.052 

SYTL3**,  GTF2H5**,  DYNLT1*, 
TMEM181,  EZR 

OAR9_35880400 

Polypay 

2.4x10~6 

1.1  x10~5 

§ 

§ 

ST18 

OAR17_5388531 

All 

9.2x10~7 

§ 

0.13 

§ 

TMEM154** 

si  9031 

All 

6.4x10~6 

i.oxicr4 

§ 

§ 

- 

OAR18_9395406 

All 

2.1  x1(T6 

1.1  x10~6 

§ 

§ 

FUS* 

OAR20_1 9572554 

All 

2.0x10~6 

3.4  x10~6 

§ 

§ 

SUPT3H**,  RUNX2** 

S56930 

All 

7.2x10~6 

8.5x10~5 

§ 

§ 

s:  P>0.15 

**:  SNP  located  within  gene 
*:  SNP  located  within  35  Kb  of  gene 
doi:1 0.1 371/journal. pone.0047829.t002 
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Chromosome 

Figure  1.  Manhattan  plot  for  susceptibility  to  OvLV.  The  Manhattan  plot  shows  nominal  P-values  from  association  with  serological  status  by 
chromosomal  position.  Representative  data  from  the  all-breeds,  additive  mode  of  inheritance  analysis  are  shown.  The  top  red  line  shows  a  genome¬ 
wide  significance  threshold  defined  by  nominal  P-values  of  IxlO"6,  which  is  P  =  0.05/50,000.  The  lower  blue  line  shows  a  genome-wide  suggestive 
significance  threshold  defined  by  IxlO"5. 
doi:1 0.1 371  /journal. pone.0047829.g001 


Discussion 

Breed  differences  in  susceptibility  to  OvLV  have  long  suggested 
genetic  involvement,  and  a  recent  study  identified  TMEM154 
mutations  as  consistently  associated  with  OvLV  prevalence  [16]. 
However,  those  mutations  did  not  confer  complete  resistance,  and 
some  flocks  with  a  high  frequency  of  the  less  susceptible  genotypes 
nonetheless  have  high  OvLV  prevalence.  This  GWAS  identified 
multiple  additional  genomic  regions  as  associated  with  OvLV 
susceptibility.  Further,  this  was  the  first  GWAS  to  examine  control 
of  OvLV  replication  to  our  knowledge,  and  multiple  genomic 
regions  were  identified  as  associated  with  control  of  OvLV 
replication.  The  short  LD  length  observed  (Figure  S2)  was 
consistent  with  other  reports  in  sheep  [17]  and  implicated  a 
smaller  number  of  genes  within  each  associated  region  than  in 
many  other  mammal  species  [21,22,23,24,25].  For  each  region 
associated  with  either  susceptibility  or  control  of  OvLV,  the 
association  suggested  one  or  more  underlying  variants  within  the 
region  had  a  functional  relationship  with  some  aspect  of  OvLV 
infection,  replication,  or  transmission.  Some  regions  contained 
promising  candidate  genes  that  suggested  further  biological 
hypotheses  for  specific  gene  product  involvement.  Below,  we 
summarize  major  results  as  defined  by  empirical  P-values  and 
discuss  future  hypotheses  of  function  to  test  for  genes  identified  in 
connection  with  odds  of  infection  and  then  for  genes  identified  as 
associated  with  OvLV  control  of  infection. 

Genes  involved  in  odds  of  infection  could  be  used  to  select  sheep 
with  lower  probabilities  of  natural  infection,  and  the  top  GWAS 
markers  provide  important  evidence  for  the  involvement  of  several 
interesting  genes.  Only  one  gene  had  been  identified  previously  as 


consistently  associated  with  susceptibility  to  OvLV  infection  [16], 
and  this  GWAS  also  confirmed  the  association  with  TMEM154 
(empirical  P  =  0. 13;  Table  1).  Further,  the  odds  ratio  was  estimated 
at  7.57  (Table  1),  which  is  consistent  with  prior  data  [16].  A  recent 
study  identified  TMEM154  in  a  human  GWAS  for  asthma  severity 
[26],  suggesting  that  TMEM154  may  play  a  conserved  role  in 
airway  immune  responses.  Only  markers  observed  at  empirical 
significance  exceeding  the  TMEM154  region  harboring  known 
mutations  (empirical  P<0.10)  will  be  discussed  further. 

The  locus  most  highly  associated  with  odds  of  infection  was  a 
SNP  in  the  DPPA2  ( Developmental  Pluripotency  Associated  2)  gene 
which  was  associated  in  both  die  all-breeds  analysis  (empirical 
P  =  0.006;  Table  1)  and  in  Polypays  analyzed  separately  (empirical 
P  =  0.048;  Table  1).  DPPA2  and  the  closely  related  DPPA4 
(. Developmental  Pluripotency  Associated  4)  gene  are  expressed  in  embryo 
and  germ  line  cells  [27],  and  there  are  many  potential  mechanisms 
by  which  DPPA2  or  DPPA4  may  be  involved  in  OvLV  infection. 
Both  DPPA2  and  the  nearby  DPPA4  play  essential  roles  in  lung 
development  and  formation  [28,29],  suggesting  differential  lung 
development  in  sheep  may  play  a  role  in  OvLV  susceptibility. 
However,  OvLV  has  been  found  in  semen  and  in  the  female 
reproductive  tract  of  sheep  [30,31],  and  the  possibility  of  sexual 
transmission  and/ or  paternal  transmission  of  the  virus  are 
important  unanswered  questions.  If  OvLV  is  sexually  or  paternally 
transmitted,  germline  expressed  DPPA2  may  play  a  role  in  some 
aspect  of  transmission.  A  third  possibility  is  differential  develop¬ 
ment  of  the  immune  system.  Since  DPPA2  is  expressed  during 
embryonic  development,  it  is  possible  that  DPPA2  variants  may 
influence  the  development  or  maintenance  of  the  immune  system. 
The  SheepQTLdb  [32,33]  shows  a  QTL  for  Haemonchus  contortus 
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Figure  2.  Manhattan  plot  for  control  of  OvLV  infection.  The  Manhattan  plot  shows  nominal  P-values  from  association  with  proviral 
concentration  by  chromosomal  position.  Representative  data  from  the  Rambouillet,  dominant  mode  of  inheritance  analysis  are  shown.  The  top  red 
line  shows  a  genome-wide  significance  threshold  defined  by  nominal  P-values  of  1x1  O'6,  which  is  P  =  0.05/50,000.  The  lower  blue  line  shows  a 
genome-wide  suggestive  significance  threshold  defined  by  nominal  P-values  of  IxlO"5. 
doi:1 0.1 371  /journal. pone.0047829.g002 


fecal  egg  count  [34]  that  overlaps  this  region.  An  intriguing 
possibility  would  involve  DPPA2  influencing  both  OvLV  and 
parasitic  infection  through  alterations  in  immune  system  develop¬ 
ment,  since  alterations  in  embryonic  and/or  white  blood  cell 
development  could  influence  formation  and/or  maintenance  of 
immune  responses. 

A  second  locus  associated  with  OvLV  susceptibility  after 
correction  for  multiple  testing  was  identified  in  Polypays 
(OARl_186779231;  empirical  P  =  0.0 1 2)  and  was  located 
837  Kb  distal  on  ovine  chromosome  1  from  the  SNP  in  DPPA2 
(Table  1).  This  distance  is  considerably  further  than  the  genome¬ 
wide  average  decline  of  LD  would  suggest  (Figure  S2),  but  there  is 
nonetheless  appreciable  LD  between  the  markers  (Table  S4).  In 
Polypays,  these  two  SNP  had  r2  =  0.40,  and  in  all  breeds  examined 
these  SNP  had  r2  =  0.23  (Table  S4).  It  is  possible  that  the 
associations  of  both  SNP  reflect  a  single  underlying  gene  through 
long-range  LD  due  to  admixture  in  the  composite  breed  formation 
of  the  Polypay  population  [35,36].  Alternatively,  a  separate 
associated  genetic  factor  could  be  present,  though  the  current 
version  of  the  sheep  genome  contains  no  annotated  genes  near 
OARl_186779231.  Nonetheless,  the  example  of  the  callipyge 
muscular  hypertrophy  in  sheep  demonstrates  that  even  a  single 
nucleotide  change  in  a  gene  desert  can  have  a  dramatic  impact 
[37,38], 

A  separate  region  associated  with  odds  of  infection  included 
STTL3  ( Synaptotagmin-Like  3)  on  ovine  chromosome  8,  and  this 
region  had  the  second  largest  odds  ratio  after  TMEM154  (Table  1). 
SYTL3  is  a  peripheral  membrane  protein  that  interacts  with 
RAB27A  (RAB27A,  member  RAS  oncogene  family)  and  is  thought  to 
play  a  role  in  vesicular  trafficking  [39,40].  To  our  knowledge  there 


has  been  no  report  of  STTL3  associated  with  any  lentivirus 
infection  in  any  mammal,  but  RAB27A  is  required  for  enveloped 
virus  assembly  of  human  cytomegalovirus  [41].  STT13  may 
regulate  RAB27A  and  thereby  interfere  with  viral  assembly. 
Further,  RAB27A  is  a  known  negative  regulator  of  phagocytosis 
[42],  and  is  involved  in  exosome  synthesis  [43].  As  such,  SYTL3 
could  also  influence  either  of  those  processes.  While  the  current 
version  of  the  sheep  genome  also  shows  SNP  OAR8_8802 1 348 
within  transcriptional  regulator  GTF2H5  ( General  Transcription 
Factor  1IH,  polypeptide  5 ),  the  gene  order  does  not  agree  with  other 
mammal  genomes  including  cow,  human,  dog,  and  mouse,  and  it 
may  well  be  a  mis-assembly  in  the  current  version  of  the  sheep 
genome. 

A  further  analysis  of  genomic  regions  associated  with  serological 
status  incorporated  data  on  risk  status  defined  by  known 
TMEM154  diplotypes  [16].  In  this  analysis,  SNP  near  DPPA2 / 
DPPA4  dropped  from  genome-wide  significant  to  genome- wide 
suggestive,  but  the  SNP  in  SVTL3  retained  its  level  of  significance 
(Table  2).  This  strengthened  the  case  for  association  between 
serological  status  and  an  underlying  variant  near  SYTL3.  It  is 
possible  that  part  of  the  association  between  serological  status  and 
SNP  in  or  near  DPPA2/DPPA4  was  due  to  random  association 
between  genotypes  at  TMEM154  and  DPPA2/DPPA4.  However, 
this  type  of  analysis  tested  association  of  SNP  in  or  near  the 
DPPA2/DPPA4  region  averaged  over  all  genotypes  of  TMEAI154. 
Thus,  it  could  underrepresent  association  in  the  presence  of 
epistasis,  where  differential  association  exists  at  one  locus  by 
genotype  at  another.  However,  the  relatively  small  overlap 
between  genotype  frequencies  of  SNP  near  DPPA2/DPPA4  and 
risk  diplotypes  oPTAlEMl  54  suggested  a  larger  sample  including 
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additional  animals  would  be  required  to  perform  a  thorough  test 
for  genetic  interaction. 

As  well  as  adding  more  genomic  regions  associated  with  odds  of 
infection,  this  was  the  first  genome-wide  study  to  examine 
association  with  the  control  of  viral  replication  as  measured  by 
proviral  concentration,  to  our  knowledge.  The  most  significant 
SNP  (s65956;  empirical  P  =  0.001)  was  located  in  a  cluster  of  zinc 
finger  genes  on  ovine  chromosome  20  (Table  3).  Zinc  finger 
proteins  are  known  to  have  undergone  duplication  and  divergent 
positive  selection  in  the  DNA-binding  zinc  finger  domain 
following  challenge  by  new  retroviruses  during  mammalian 
evolution  [44,45].  This  suggests  the  hypothesis  that  certain  zinc 
finger  proteins  might  interfere  with  lentiviral  replication,  and 
indeed  that  has  recently  been  shown  with  the  ^trw  Finger  Antiviral 
Protein  (ZAP)  and  HIV- 1  [46] .  However,  to  our  knowledge  none  of 
the  zinc  finger  proteins  in  the  cluster  identified  here  has  ever  been 
shown  to  be  associated  with  any  mammalian  lentiviral  infection. 

Another  empirically  significant  SNP  (s27054;  P  =  0.047)  was 
located  within  both  C19orf42  ([human]  Chromosome  19  Open  Reading 
Frame  19)  and  TMEM38A  ( Transmembrane  Protein  38A)  (Table  3). 
Not  much  is  known  about  either  C19orf42  or  TMEM38A  at  this 
time.  C19orf42  is  a  short  (75  amino  acid)  open  reading  frame  with 
similarity  to  Yosl,  a  yeast  protein  required  for  transport  between 
endoplasmic  reticulum  and  Golgi  complex  [47].  As  such,  C19orf42 
could  be  involved  in  efficiency  of  viral  packaging,  but  data  to 
demonstrate  such  involvement  are  currently  lacking.  TMEM38A 
encodes  a  protein  associated  with  vascular  smooth  muscle  control 
of  blood  pressure  [48].  It  is  possible  that  TMEM38A  could  be 
involved  by  an  as-yet  undetermined  mechanism.  Much  work 
remains  to  be  done  to  elucidate  the  potential  roles  of  these  genes  in 
limiting  OvLV  replication. 

Finally,  there  were  two  additional  regions  with  suggestive 
empirical  association  (P<0. 1 0)  with  proviral  concentration.  The 
first  included  two  SNP  on  ovine  chromosome  9  in  Polypays 
(Table  3),  but  no  genes  are  annotated  in  that  region  in  the  current 
pre-release  version  of  the  sheep  genome.  The  other  is  SNP 
OAR23_40410527  located  within  the  DLGAP1  ( Discs  Large 
(Drosophila)  Homolog-Associated  Protein  1)  gene  (Table  3).  The  product 
of  DLGAP1  is  a  primary  interacting  protein  of  DLG1  and  is 
believed  to  function  by  holding  DLG1  in  place  at  cell  junctions 
[49,50,51].  DLG1  is  a  strong  negative  regulator  of  HIV-1 
infectivity,  such  that  depletion  of  DLG1  enhances  HIV  infectivity 
5—6  fold  [52].  The  allelic  variant(s)  of  DLGAP1  in  sheep  may 
physically  interfere  with  DLG1  positioning  and/ or  function. 
Though  DLGAP1  was  associated  with  mouse  survival  following 
influenza  H5N1  infection  [53],  possibly  mediated  through  its 
interaction  with  DLG1  [54],  to  our  knowledge  DLGAP1  has  not 
been  reported  to  have  a  role  in  lentiviral  infection  in  any  mammal 
to  date. 

In  conclusion,  this  GWAS  found  many  additional  genomic 
regions  besides  TMEM154  associated  with  odds  of  infection,  and 
this  was  the  first  genome-wide  study  to  provide  regions  associated 
with  OvLV  control  as  measured  by  proviral  concentration.  These 
results  confirmed  that  TMEM154  is  an  important  component  of 
host  susceptibility  to  OvLV,  but  there  were  many  additional 
factors  that  contributed  to  genetic  differences  in  susceptibility  to 
and  control  of  OvLV  infection.  These  included  many  genes  never 
previously  associated  with  lentiviral  infection,  which  may  extend 
structural  and/ or  regulatory  networks  implicated  in  lentiviral 
transmission,  infection,  and/ or  replication.  These  genes  provide 
targets  for  additional  investigation  into  lentiviral  infection  that  may 
generalize  beyond  OvLV  to  other  members  of  the  lentiviral  family. 
Further,  the  associated  genes  provide  the  basis  for  additional  work 
to  identify  genetic  markers  associated  with  odds  of  OvLV  infection 
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and  proviral  concentration.  As  has  already  been  done  with 
TMEM154 ,  such  markers  may  be  used  to  select  sheep  with  lower 
odds  and/or  improved  control  of  OvLV  infection.  Finally,  they 
may  also  be  of  great  interest  for  further  study  in  goats,  which  are 
host  to  the  closely  related  caprine  encephalitis  arthritis  virus. 

Materials  and  Methods 

Ethics  Statement 

All  animal  care  and  handling  procedures  were  reviewed  and 
approved  by  the  Washington  State  University  Institutional  Animal 
Care  and  Use  Committee  (Permit  Number:  3171)  and/or  by  the 
U.S.  Sheep  Experiment  Station  Animal  Care  and  Use  Committee 
(Permit  Numbers:  10-06,  10-07).  All  efforts  were  made  to 
minimize  any  discomfort  during  blood  sampling. 

Populations  and  Phenotypes 

Flocks  were  chosen  for  study  with  high  historical  OvLV 
seroprevalence  among  mature  ewes  [15].  Specifically,  whole  blood 
was  collected  from  ewes  of  Rambouillet  (N  =  4 1 4),  Polypay 
(N  =  438),  and  Columbia  (N  =  145)  breeds,  ages  1-5  years,  from 
the  U.S.  Sheep  Experiment  Station.  These  animals  were  managed 
similarly  but  bred  separately  in  pure  breed  groups.  Blood  was 
processed  for  serum  and  peripheral  blood  leukocytes  as  previously 
described  [19].  A  subset  of  365  older  ewes  (ages  3-5)  had  been 
used  as  validation  animals  in  a  previous  study  on  TMEM154  [16]. 
Serologic  data  were  collected  using  a  competitive  ELISA  assay 
(VMRD  Inc.,  Pullman,  WA)  to  detect  anti-OvLV  antibodies  in 
sheep  [55].  DNA  was  extracted  from  peripheral  blood  leukocytes 
and  OvLV  proviral  concentrations  were  determined  by  a  validated 
qPCR  method  [19]. 

Genotyping 

Blood  was  collected  by  jugular  venipuncture  into  EDTA-coated 
vacutainer  tubes.  DNA  was  isolated  using  the  Invitrogen 
GeneCatcher™  gDNA  3-10  ml  Blood  Kit  as  per  manufacturers’ 
instructions  (Life  Technologies,  Carlsbad,  CA).  The  DNAs  were 
checked  for  quality  and  quantity  using  an  ND-1000  spectropho¬ 
tometer  (Nanodrop,  Wilmington,  DE)  and  equilibrated  to  50  ng/ 
pi  for  genotyping.  Genotyping  services  were  provided  by  Geneseek 
Inc.  (Lincoln,  NE)  using  the  OvineSNPSO  Infinium  BeadChip 
(Illumina  Inc.,  San  Diego,  CA)  with  a  set  of  54,977  SNP  designed 
by  the  International  Sheep  Genome  Consortium  [17], 

Statistical  analysis 

Genotypic  quality  control  and  clustering.  Initial  quality 
control  criteria  included  low  genotype  call  rates  and  high  genotype 
identity  to  any  other  sample.  Samples  with  low  genotype  call  rates 
(<97%)  were  removed  from  additional  analysis.  PLINK  identity- 
by-state  analysis  identified  samples  that  could  have  been  involved 
in  label  errors  (as  determined  by  IBS  distances  >0.95)  for  removal 
front  additional  analysis.  Multidimensional  scaling  (MDS)  was 
performed  in  the  PLINK  software  package  (http://pngu.mgh. 
harvard.edu/purcell/plink/)  written  by  Shaun  Purcell  [56]  to 
check  for  genotypic  outliers.  MDS  was  performed  on  a  reduced  set 
of  markers  in  approximate  linkage  equilibrium  from  animals  of 
each  breed  separately.  Outlier  animals  were  removed  to  reduce 
population  stratification.  Then,  a  pairwise  population  concor¬ 
dance  test  (-ppc  option)  of  the  hypothesis  that  each  animal  was 
front  a  separate  breeding  population  was  performed  using  a  P- 
value  of  10  ,  which  is  P  =  0.05  after  Bonferroni-correction  for  the 

total  number  of  paiiwise  comparisons  between  animals.  Where  the 
resulting  clusters  contained  additional  information  compared  to 
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recorded  breed  data,  the  clusters  were  included  in  addition  to 
breed  as  stratification  categories  for  later  association  analyses. 

Linkage  Disequilibrium  Versus  Genomic 

Distance.  Analysis  of  high  density  SNP  data  was  performed 
using  PLINK  vl.06  [56].  A  preliminary  screen  was  performed  to 
eliminate  SNP  with  minor  allele  frequencies  less  than  0.10  from 
comparisons  used  to  calculate  average  LD.  LD  within  each  breed 
was  calculated  with  settings  to  include  comparisons  of  all  SNP 
within  5  Mbp  of  each  other  and  minimum  reported  r2  adjusted  to 
0.0  to  include  even  low  LD  comparisons.  These  r2  values  were 
analyzed  by  distance  bins  with  SAS  9.2  (SAS  Institute,  Cary,  NC). 
Distance  bins  were  calculated  for  each  5  Kbp  interval  beginning 
with  0-5  Kbp.  Mean  r2  was  only  included  in  analysis  if  a  bin 
contained  at  least  1000  values  to  assure  each  reported  mean  was 
well-estimated. 

Association  analysis.  The  OvLV  positive  or  negative  status 
as  measured  by  cELISA  was  analyzed  using  separate  logistic 
models  for  the  minor  allele  of  each  SNP  in  PLINK  to  account  for 
breed  and  pairwise  population  concordance  clusters,  for  animal 
age  as  a  covariate,  and  for  the  SNP  minor  allele.  Logio- 
transformed  proviral  concentrations  were  analyzed  using  similar 
separate  general  linear  models  for  the  minor  allele  of  each  SNP  in 
PLINK  to  account  for  breed  and  pairwise  population  concordance 
clusters,  for  animal  age  as  a  covariate,  and  for  the  SNP  minor 
allele.  For  both  logistic  and  linear  analyses,  PLINK  screening 
criteria  were  employed  including  missingness  by  individual  (0.1), 
missingness  by  marker  (0.03),  minor  allele  frequency  (0.01),  and 
Hardy- Weinberg  equilibrium  (0.000001,  which  is  P  =  0.05  Bon- 
ferroni-corrected  for  50,000  SNP  tests).  Genome-wide  significance 
was  defined  by  empirical  P<0.05.  Genome-wide  suggestive  results 
were  defined  by  nominal  P-values  <  lxlO-’  [57].  Both  logistic 
and  linear  analyses  were  performed  using  association  models 
including  additive  allelic,  genotypic  2  degree-of-freedom,  domi¬ 
nant,  and  recessive.  Family  structure  was  addressed  for  both 
logistic  and  general  linear  models  using  permutation  within  sire 
families  of  10  or  more  genotyped  offspring;  the  remaining  sire 
families  with  fewer  than  10  offspring  were  grouped  together  for 
permutation  purposes.  One  thousand  permutations  were  used  to 
obtain  each  empirical  P-value.  Additional  analyses  of  serologic 
data  included  the  presence  or  absence  of  risk  haplotypes  as  defined 
by  Heaton  et  al.  [16]  as  covariates  in  the  respective  association 
models.  Visualization  of  association  data  in  manhattan  and 
quantile-quantile  plots  was  performed  using  a  script  generously 
provided  by  Dr.  Stephen  Turner  (http://gettinggeneticsdone. 
blogspot.com/20 1 1  / 04/ annotated-manhattan-plots-and-qq-plots. 
html,  viewed  on  11-15-11)  using  the  R  environment  [58].  Since 
PLINK  only  reports  regression  coefficients  as  a  measure  of  effect 
size  for  linear  regression,  SAS  9.2  (SAS  Institute,  Cary,  NC)  was 
used  to  run  similar  genotypic  models  in  the  general  linear  models 
procedure  to  obtain  largest  adjusted  genotypic  mean  differences  in 
log  io — transformed  proviral  concentration  as  a  measure  of  effect 
size  for  control  of  OvLV. 

Supporting  Information 

Figure  SI  Multidimensional  scaling  of  genotypes  show¬ 
ing  3  breed  clusters  of  animals  included  in  analysis. 

Columbias  are  included  in  the  top  cluster,  Polypays  in  the  bottom 
right  cluster,  and  Rambouillets  in  the  bottom  left  cluster.  The 
clustering  of  individuals  by  breed  is  clear  even  from  these  related 
breeds.  The  Columbia  breed  was  developed  pre-1920  with  'A 
Rambouillet  composition  [59] .  The  Polypay  breed  was  developed 
in  the  1970s  with  lA  Rambouillet  composition  [36]. 

(PDF) 
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Figure  S2  Decline  of  linkage  disequilibrium  with  dis¬ 
tance  by  breed. 

(PDF) 

Figure  S3  Quantile-Quantile  plot  for  odds  of  infection. 

Quantile-quantile  plots  from  association  with  serological  status, 
where  the  red  line  shows  the  expected  distribution.  Representative 
data  from  the  all-breeds,  additive  mode  of  inheritance  analysis  are 
shown.  The  results  show  deviation  from  the  expected  distribution 
indicating  population  stratification  by  factors  unaccounted  in  the 
analytic  model,  which  could  include  frequencies  of  underlying 
mutations  for  susceptibility  loci  that  differ  between  seropositive 
and  seronegative  individuals. 

(PDF) 

Figure  S4  Quantile-Quantile  plot  for  odds  of  infection 
conditioned  on  TMEM154  risk  status.  A  second  analysis 
conditioned  on  TMEM154  risk  status  shows  an  observed  distribu¬ 
tion  closer  to  expected  than  the  primary  analysis,  but  still  does  not 
account  for  the  majority  of  apparent  population  stratification. 

(PDF) 

Figure  S5  Quantile-Quantile  plot  for  odds  of  infection 
conditioned  on  Table  1  SNP.  A  third  analysis  conditioned  on 
all  the  SNP  in  Table  1  shows  a  distribution  much  closer  to 
expected,  demonstrating  that  host  genetic  factors  tracked  by  these 
SNP  account  for  the  majority  of  apparent  population  stratification. 
(PDF) 

Figure  S6  Quantile-Quantile  plot  for  control  of  viral 
replication.  Quantile-quantile  plot  from  association  with 
proviral  concentration,  where  the  red  line  shows  the  expected 
distribution.  Representative  data  from  the  Rambouillet,  dominant 
mode  of  inheritance  analysis  are  shown.  The  results  show 
deviation  from  the  expected  distribution  indicating  population 
stratification  by  factors  unaccounted  in  the  analytic  model,  which 
could  include  frequencies  of  underlying  mutations  for  susceptibility 
loci  that  differ  by  proviral  concentration. 

(PDF) 

Figure  S7  Quantile-Quantile  plot  for  control  of  viral 
replication  conditioned  on  Table  3  SNP.  A  second  analysis 
was  performed  by  conditioning  on  all  the  SNP  in  Table  3,  minus 
close  equivalents  on  the  same  chromosome  (r2>0.8;  removed  to 
prevent  inestimable  multicollinearity)  for  which  only  the  best  P- 
value  SNP  was  retained  from  each  pair.  This  analysis  shows  a 
distribution  much  closer  to  expected,  demonstrating  that  host 
genetic  factors  tracked  by  these  SNP  account  for  the  majority  of 
apparent  population  stratification. 

(PDF) 

Table  SI  Adjusted  genotypic  mean  proviral  concentra¬ 
tions  for  Table  3  SNP.  Mean  proviral  concentrations  by 
genotype,  adjusted  for  age  and  breed. 

(DOC) 

Table  S2  Genomic  regions  from  Columbia  breed  asso¬ 
ciated  with  proviral  concentration  of  ovine  lentivirus. 

Genomic  regions  from  Columbia  breed  associated  with  proviral 
concentration  of  ovine  lentivirus. 

(DOC) 

Table  S3  Genotype  frequencies  by  animal  set  for  SNP 
from  Tables  1,  2,  and  S2. 

(XLSX) 

Table  S4  Linkage  disequilibrium  by  animal  set  between 
SNP  from  Tables  1,  2,  and  S2  by  chromosome. 

(XLSX) 
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